Eeciency Considerations in Very Large Information Retrieval Servers
نویسندگان
چکیده
It is estimated that the World Wide Web now contains more than twenty million diierent content areas, presented on more than 320 million web pages, and one million web servers| and it is doubling every nine months 16, 17]. To combat this, Moore's law suggests that computational resource will continue to double every eighteen months. This suggests that| although both curves are exponential|we may be losing ground in the quest to search and nd useful information. We brieey review some of the algorithms known to be useful for very large scale information retrieval and suggest opportunities for improvement in this area. Without research in this area, the web and other electronic information sources will become large haystacks in which no one will be able to nd any needles.
منابع مشابه
Eeciency Considerations for Scalable Information Retrieval Servers
We overview a variety of techniques to improve eeciency in information retrieval. Given the increasing volumes of data that are available electronically, understanding and using such techniques is critical. We address several eeciency concerns, but our primary focus is on index processing since it dominates the computational demands of information retrieval. Given the importance of index proces...
متن کاملBehavioral Considerations in Developing Web Information Systems: User-centered Design Agenda
The current paper explores designing a web information retrieval system regarding the searching behavior of users in real and everyday life. Designing an information system that is closely linked to human behavior is equally important for providers and the end users. From an Information Science point of view, four approaches in designing information retrieval systems were identified as system-...
متن کاملImproved Skips for Faster Postings List Intersection
Information retrieval can be achieved through computerized processes by generating a list of relevant responses to a query. The document processor, matching function and query analyzer are the main components of an information retrieval system. Document retrieval system is fundamentally based on: Boolean, vector-space, probabilistic, and language models. In this paper, a new methodology for mat...
متن کاملImproved Skips for Faster Postings List Intersection
Information retrieval can be achieved through computerized processes by generating a list of relevant responses to a query. The document processor, matching function and query analyzer are the main components of an information retrieval system. Document retrieval system is fundamentally based on: Boolean, vector-space, probabilistic, and language models. In this paper, a new methodology for mat...
متن کاملPerformance Evaluation of Medical Image Retrieval Systems Based on a Systematic Review of the Current Literature
Background and Aim: Image, as a kind of information vehicle which can convey a large volume of information, is important especially in medicine field. Existence of different attributes of image features and various search algorithms in medical image retrieval systems and lack of an authority to evaluate the quality of retrieval systems, make a systematic review in medical image retrieval system...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1999